cd/entity/David Wangยท homeโ€บ entitiesโ€บ David Wang
grep -l @david wang /news/*.json | wc -l โ†’ 1

@David Wang

mentions 1 type Person feed RSS
23:22
2026-06-11
modal.com
large-language-models

Making FlashAttention-4 faster for inference

Modal AI engineers Charles Frye and David Wang optimized FlashAttention-4 for large language model inference, focusing on decode-heavy workloads dominated by memory bandwidth-limited token generation.โ€ฆ

// co-occurs with top 3 entities